Development Of An Automatic Trend Exploration System Using The MuST Data Collection

نویسندگان

  • Masaki Murata
  • Koji Ichii
  • Qing Ma
  • Tamotsu Shirado
  • Toshiyuki Kanamaru
  • Sachiyo Tsukawaki
  • Hitoshi Isahara
چکیده

The automatic extraction of trend information from text documents such as newspaper articles would be useful for exploring and examining trends. To enable this, we used data sets provided by a workshop on multimodal summarization for trend information (the MuST Workshop) to construct an automatic trend exploration system. This system first extracts units, temporals, and item expressions from newspaper articles, then it extracts sets of expressions as trend information, and finally it arranges the sets and displays them in graphs. For example, when documents concerning the politics are given, the system extracts “%” and “Cabinet approval rating” as a unit and an item expression including temporal expressions. It next extracts values related to “%”. Finally, it makes a graph where temporal expressions are used for the horizontal axis and the value of percentage is shown on the vertical axis. This graph indicates the trend of Cabinet approval rating and is useful for investigating Cabinet approval rating. Graphs are obviously easy to recognize and useful for understanding information described in documents. In experiments, when we judged the extraction of a correct graph as the top output to be correct, the system accuracy was 0.2500 in evaluation A and 0.3334 in evaluation B. (In evaluation A, a graph where 75% or more of the points were correct was judged to be correct; in evaluation B, a graph where 50% or more of the points were correct was judged to be correct.) When we judged the extraction of a correct graph in the top five outputs to be correct, accuracy rose to 0.4167 in evaluation A and 0.6250 in evaluation B. Our system is convenient and effective because it can output a graph that includes trend information at these levels of accuracy when given only a set of documents as input.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Development of an Automatic Land Use Extraction System in Urban Areas using VHR Aerial Imagery and GIS Vector Data

Lack of detailed land use (LU) information and efficient data collection methods have made the modeling of urban systems difficult. This study aims to develop a novel hierarchical rule-based LU extraction framework using geographic vector and remotely sensed (RS) data, in order to extract detailed subzonal LU information, residential LU in this study. The LU extraction system is developed to ex...

متن کامل

Pic Microcontroller-Based Automatic Meter Reading (AMR) System Using the Low Voltage (LV) Power Line Network (TECHNICAL NOTE)

Automatic Meter Reading (AMR) is the remote collection of consumption data from customer’s utility meters over telecommunications, radio, power line and other links. AMR provides water, electric and gas utility service companies the opportunity to streamline metering, billing and collection activities, increase operational efficiency and improve customer service. The AMR system consists of thre...

متن کامل

AN-EUL method for automatic interpretation of potential field data in unexploded ordnances (UXO) detection

We have applied an automatic interpretation method of potential data called AN-EUL in unexploded ordnance (UXO) prospective which is indeed a combination of the analytic signal and the Euler deconvolution approaches. The method can be applied for both magnetic and gravity data as well for gradient surveys based upon the concept of the structural index (SI) of a potential anomaly which is relate...

متن کامل

تبیین و رتبه بندی شاخص های ساختاری مؤثر بر فرآیند کارآفرینی با استفاده از رویکرد دلفی فازی و AHP فازی در شیلات استان مازندران

Organizational entrepreneurship is the process by which organizations identify opportunities for growth and development and create new value for customers through innovation and reallocation of resources. Meanwhile, the structural indicators are the crucial and influential factors on the development of the organizational entrepreneurship process. The purpose of the present research is the verif...

متن کامل

A Reliable and Economically Feasible Automatic Meter Reading System Using Power Line Distribution Network (TECHNICAL NOTE)

Automatic Meter Reading (AMR) is the remote collection of consumption data from customer’s utility meters over telecommunications, radio, power line and other links. AMR provides water, electric and gas utility−service companies the opportunities to streamline metering, billing and collection activities, increase operational efficiency and improve customer service. Utility company uses technolo...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006